Reinforcement Learning Inspired Disturbance Rejection and Nao Bipedal Locomotion

نویسنده

  • Bernhard Hengst
چکیده

Competitive bipedal soccer playing robots need to move fast and react quickly to changes in direction while staying upright. This paper describes the application of reinforcement learning to stabilise a flat-footed humanoid robot. An optimal control policy is learned using a physics simulator. The learned policy is supported theoretically and interpreted on a real robot as a linearised continuous control function. The paper also describes other components, including foot-step coordination, of bipedal locomotion integrated to achieve reactive omni-directional locomotion for Nao robots used in the RoboCup Standard Platform League.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bipedal Locomotion Primitive Learning, Control and Prediction from Human Data

At the current stage bipedal robot locomotion is quite different from human walking. Imitation learning framework from human demonstrations is an efficient approach to lead towards human-like behaviors. This paper addresses a framework for real-time wholebody human motion imitation by a humanoid robot. The framework is a structured mixture of whole body motion control, learning and prediction. ...

متن کامل

Lateral Disturbance Rejection for the Nao Robot

Maintaining balance in the presence of disturbances is crucial for bipedal robots. In this paper, we focus on the lateral motion component. In order to attain disturbance rejection and to quickly recover balance, we combine three different control approaches. As a principal building block, we generate center of mass trajectories with a linear model predictive controller that takes scheduled foo...

متن کامل

Feedback Control For Cassie With Deep Reinforcement Learning

Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully aware of many details, including torque limits, joint limits, and other non-linearities that are necessarily excluded from the...

متن کامل

Gait Generation for a Bipedal System By Morris-Lecar Central Pattern Generator

The ability to move in complex environments is one of the most important features of humans and animals. In this work, we exploit a bio-inspired method to generate different gaits in a bipedal locomotion system. We use the 4-cell CPG model developed by Pinto [21]. This model has been established on symmetric coupling between the cells which are responsible for generating oscillatory signals. Th...

متن کامل

Exploiting Human Motor Skills for Training Bipedal Robots Undergraduate Honors Thesis

Although machine learning, reinforcement learning, and learning from demonstration have improved the rate and accuracy at which robots can gain intelligence from humans, they haven’t reached the rapid rate at which humans are able to acquire new knowledge. Many systems that exploit imitation learning use simple positive and negative reinforcement, and place the burden of learning completely on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015